XReason: A Semantic Approach That Reasons with Patterns to Answer XML Keyword Queries

نویسندگان

  • Cem Aksoy
  • Aggeliki Dimitriou
  • Dimitri Theodoratos
  • Xiaoying Wu
چکیده

Keyword search is a popular technique which allows querying multiple data sources on the web without having full knowledge of their structure. This flexibility comes with a drawback: usually, even though a large number of results match the user’s request only few of them are relevant to her intent. Since data on the web are often in tree-structured form, several approaches have been suggested in the past which attempt to exploit the structural properties of the data in order to filter out irrelevant results and return meaningful answers. This is certainly a difficult task, and depending on the type of dataset, these approaches show low precision and/or recall. In this paper, we introduce an original approach for answering keyword queries called XReason. XReason identifies structural patterns in the keyword matches and reasons with them in order to return meaningful results and to rank them with respect to their relevance. Our semantics shows a non-monotonic behavior and in the presence of additional patterns, it is able to better converge to the users intent. We design an efficient stack-based algorithm for evaluating keyword queries on tree structured data, and we run experiments to evaluate its efficiency and the effectiveness of our semantics as a filtering and ranking system. Our results show that our approach shows better performance than the other approaches in many cases of real and benchmark datasets.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Keyword search across distributed heterogenous structured data sources

Many applications and users require integrated data from multiple, distributed, heterogeneous (semi-) structured sources. Sources are relational databases, XML databases, or even structured Web resources. Mediator systems represent one class of solutions for data integration. They provide a uniform view and uniform way to query the virtually integrated data. As data resides in the local sources...

متن کامل

Keyword Search Interface for Path Queries on Ontology

by SUJEETH THIRUMALAI (Under the Direction of Amit P. Sheth & Lakshmish M. Ramaswamy) ABSTRACT Today’s semantic web has a growing wealth of machine understandable metadata represented using markup languages like RDF, XML or OWL. There exists a plethora of query languages that aid is searching such data models. However, most real world searches involve queries expressed in natural language as it...

متن کامل

Processing XML Keyword Search by Constructing Effective Structured Queries

Recently, keyword search has attracted a great deal of attention in XML database. It is hard to directly improve the relevancy of XML keyword search because lots of keyword-matched nodes may not contribute to the results. To address this challenge, in this paper we design an adaptive XML keyword search approach, called XBridge, that can derive the semantics of a keyword query and generate a set...

متن کامل

Semantic Search over XML Document Streams

A large number of web data sources, such as blogs, news sites and podcast hosts, are currently disseminating their content in the form of streaming XML documents. The variability and heterogeneity of those sources make the employment of traditional querying schemes, which are based on structured query languages, cumbersome for the end user (those languages require precise knowledge of the under...

متن کامل

SAIL: Structure-aware indexing for effective and progressive top-k keyword search over XML documents

Keyword search in XML documents has recently gained a lot of research attention. Given a keyword query, existing approaches first compute the lowest common ancestors (LCAs) or their variants of XML elements that contain the input keywords, and then identify the subtrees rooted at the LCAs as the answer. In this the paper we study how to use the rich structural relationships embedded in XML docu...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013